Demonstrating Laughter Detection in Natural Discourses
نویسندگان
چکیده
This work focuses on the demonstration of previously achieved results in the automatic detection of laughter from natural discourses. In the previous work features of two different modalities, namely audio and video from unobtrusive sources, were used to build a system of recurrent neural networks called Echo State networks to model the dynamics of laughter. This model was then again utilized to detect laughters from presented test data. The approach was used to confirm human labels of laughter and to detect laughter in previously unmarked data, which resulted in nice results in offline applications. As reported in a publication currently under revision accuracies of 90 % were achieved using the multi modal input data relying on modulation spectral features from one microphone and movement data of a 360 degree camera positioned in the middle of a conference table [1]. In this work however, we would like to show a proof of concept for the online and on the fly recognition of laughter performing close to real-time. The goal of this work is to use a previously trained model of laughter in a modular process engine environment, which is currently under development overcoming known difficulties of pattern recognition and information fusion tasks, to detect laughter from a continuous microphone input. The intended application may then be used in online applications such as robots interacting with humans or the evaluation of human to human communication. Furthermore, the detection of laughter is an integral part of the improvement of dialog systems towards an affect understanding machine, since laughter is an important part of a healthy and natural communication.
منابع مشابه
"Did you laugh enough today?" - Deep Neural Networks for Mobile and Wearable Laughter Trackers
In this paper we describe a mobile and wearable devices app that recognises laughter from speech in real-time. The laughter detection is based on a deep neural network architecture, which runs smoothly and robustly, even natively on a smartwatch. Further, this paper presents results demonstrating that our approach achieves state-of-the-art laughter detection performance on the SSPNet Vocalizati...
متن کاملAnalysis of the occurrence of laughter in meetings
Automatic speech understanding in natural multiparty conversation settings stands to gain from parsing not only verbal but also non-verbal vocal communicative behaviors. In this work, we study the most frequently annotated non-verbal behavior, laughter, whose detection has clear implications for speech understanding tasks, and for the automatic recognition of affect in particular. To complement...
متن کاملDecision-Level Fusion for Audio-Visual Laughter Detection
Laughter is a highly variable signal, which can be caused by a spectrum of emotions. This makes the automatic detection of laughter a challenging, but interesting task. We perform automatic laughter detection using audio-visual data from the AMI Meeting Corpus. Audiovisual laughter detection is performed by fusing the results of separate audio and video classifiers on the decision level. This r...
متن کاملLaughter Classification Using Deep Rectifier Neural Networks with a Minimal Feature Subset
Laughter is one of the most important paralinguistic events, and it has specific roles in human conversation. The automatic detection of laughter occurrences in human speech can aid automatic speech recognition systems as well as some paralinguistic tasks such as emotion detection. In this study we apply Deep Neural Networks (DNN) for laughter detection, as this technology is nowadays considere...
متن کاملFusion for Audio-Visual Laughter Detection
Laughter is a highly variable signal, and can express a spectrum of emotions. This makes the automatic detection of laughter a challenging but interesting task. We perform automatic laughter detection using audio-visual data from the AMI Meeting Corpus. Audio-visual laughter detection is performed by combining (fusing) the results of a separate audio and video classifier on the decision level. ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009